|
|
Accession Number |
TCMCG075C00066 |
gbkey |
CDS |
Protein Id |
XP_007046491.2 |
Location |
join(240489..240614,240713..240764,240857..240904,241012..241166,241264..241321,241827..241884,242038..242130,242216..242381,242473..242488,242580..242800,243349..243418,243511..243573,243724..245145,245222..245300,245404..245513,245751..245824,245931..246128,246665..246709,246783..246836,246933..247039,247289..247337,247435..247494,247611..247683,248023..248108,248250..248423) |
Gene |
LOC18610643 |
GeneID |
18610643 |
Organism |
Theobroma cacao |
|
|
Length |
1218aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007046429.2
|
Definition |
PREDICTED: DNA mismatch repair protein MLH3 isoform X1 [Theobroma cacao] |
CDS: ATGGGGAGCATTAAGCCCTTGCCAGAGGCTGTTCGTAGTTCGGTGCGTTCTGCCATTATATTGTTTGACTTGACTAGGGTTGTGGAGGAGCTCATTTTCAACAGCCTCGATGCTTCTGCTTCAAAGGTGTCAGTCTTTGTAAGTGTCGGGAGCAGCTATGTCAAAGTGGTGGATGATGGATCTGGTATATCTCGTGATGGATTGGTGTCACTGGGAGAAAGATATGTAACATCAAAGCTTTACCATCTGGGTGATTTGGATGCTGCCAGCAGGAGCTTTGGCTTTCGGGGAGAAGCACTGGCTTCTATATCTGATGTAGCCTTGGTGGAAATAATAACAAAAGCTTACGGAAAGCCAAATGGGTACCGCAAGGTCATTAAGGGATCCAAGTGTTTGTATCTTGGAATTGATGATGATAGGAAAGATGCAGGTACAACAGTTGTCGTGCGTGATTTATTTTACAACCAACCTGTTCGGAAGAAGCATATGCAATCCTGCCCTAAGAAGGTGTTGCACTCAGTTAAAAAGTGCGTATTCAGAATGGCCCTTGTGCACCCAATGGTTTACTTCAATGTGATTGATATTGAAAGTGAGGATGAGCTTCTCAGTACGCATCCTTCCTCTTCTCCTTTGTCACTTTTAATGAGTGGTTTTGGGATTGAGGACTGTACCTCTCTGCAGAAGCTGAATGCTGATGATGGTTCCCTCAAGCTTTCTGGCTACATAACTGGCTCCTGGGACAATTTTGCTGTTAAGGCCTTTCAATTTGTTTATATCAATTCAAGGTTTGTCTGCAAGGGTCCCATTCATAAGTTGCTGAACAACTTGGCCACTAGTTTTGAGTCTTTAGATTCAAAGAAGGCTAACAACTGGACCAAGAAAGGAAAGAGGAGTAGACCTCAAGTATTTCCGTCCTACATACTGAATATTAGTTGCCCTCCTTCTTTCTATGATTTAACCTTAGAACCATCAAAGACATATGTTGAATTCAAGGATTGGGCATCTATACTTACCTTAATTGAGAAGACAATTCAACACCTCTGGAGGAAAAATATTTGTCGTGCCAATGGATTAGGACAAGCTGAAACTTTGAAGGAAGATGACAATATCTTACATGTGGAAGAAGATTTTTTTGATGAAGGACCATCTGTGGACTCAGAATTTGCAACAAGGAAACGTTGGACTCAAAAATATCGGCCTTCTTCTTCATTAGAGAAGCTAACAACAGATCATTTGTTTCTTACAGACCATGAAGATATTCCATTTGAGGAGTGCCATGTGAATAATGCACAATTTAGAGATCAACAAAACAATATGAAATTTGTTCATTGGACTGACTATTCTTTTCAAAGTTGGGATGATTCCCTTGTCAAAGGCACATCCTCAGTATTTGAAAGGAGTGATTGTTGTCTTTTGACAACTAATAACAATTCTTTAGTTGAGGATTACTTCTTGGAAAATAGATTCACTGCTTCAGGAAGATCAAACTGTCATGTGAACAACAATGGTATATGTTCAAAGTTAGGTAATGCATCCGATGTGGTTGAGAGTGATGTGACCAATGGAACAGATAGGAACATATTTCCTTTTGATTATCATGAACATTACAATGACTCACAGTTCAGAAAGAATATCAGCAAGCCTTTTCTGCAAAGTTGCTCCTCCCAAAGAACCTTGCCACTTGACAGGGAGTTGGTTGAAAGTGAGAAAGGAATTGAACCACCAATGGATAGCTTTAAGACCAAAGCGAAGCAGGTTTGCTCAAATGAAAGGTTCAATATGCTGAAAACTGATTCCAGTGATCAGACCATGTGGCAGGATGGAGGACCATGCGGTCAAATTTATCCCAAACTTGTAAGTAAAGGTGGGATTGCTAGAGATTTGGATGTTCTAACAAGGGCTTCTGCCAAATCGTTCCTGTCATGTGGAGATGTCTCTATTGAAGAGAATGGCCTTCCATCTGATTCAGTCACACCAATAGAAAAAACTGGCTCTGGTCATCAGTCCTTAAGTTCTGAATGGTGTTCAGGAACCTCTAATCCCTTTGAGCAGTTCAGTTATAAAAATCCAATTGAAGGGTGCTTCAGATCTGAAGAAAGGACCAACTTTGGGCATTTCTCTGCTGGTGAAGATGAGGACTACCAATTTAGCTTTGACCTAATCTCAAGGAGCTCCAGCCAAGAAAAATGCATCTATGATTGTCCAAACACTGGACTAGAAATTGACTATGCCAAATCTAGTAGAGATTTTCATGGATTCCTTCAACAATACAATCTAAATCATACATTTTCTCCAGAAGATTCCAATGTAGCAATTGAAGAGAGAGACTGGTTGTGTACAGACTCAAGTATTAATGAATATAAAAGACAAATCGATTGGTTTCAATATCAAGATGTTGAACAAAATCCTATTCCTAAAGAAAGAGCAAGAAGAAGCCAGTCAGCTCCTCCATTTTGCAGCTACAAGAGGAGGTTTATCTCCTTACATCATTGTTTGGCATCAGGGGAACCCACTTTTAGTGAAGTCCGTGGTCCATTCACTTCTCCAGAGATTGGTGAGAAGAAGCCTCCCCAACAATCTTCTGGTGTGGACAATCTACATTTTGAACCAAGTTTTGGAAAGAATAGATCAAATATGAATAACAAGCCAAACATGGTGTTCAGCACTGTAGTTCGAAAATGTGAAGACATTGAACAACCTCATTGCCTAGAGGGTCCTGAATCAGCTCCGGTGCAAGTATTTATCTCAAAGGGAAATCAGGATCCAGCAAATTCTGGAACCAAATGGCGGAGTGGTTTTGCACAGAATACAAGCAACAGCAAATTATGTGATATTGACTATGAATATAATGTACTTGACATTGCGTCCGGATTGCCCTTTGTTGCCACTAAATCATTGGTTCCTGAATCTATCAATAAGAATTGTCTCAGAGATGCCAAGGTTCTGCAACAGGTGGATAAGAAATTCATCCCAATTGTAGCTGGCGGAACACTTGCTATTATTGATCAGCATGCGGCAGATGAAAGAATTCAACTAGAAGAACTTCGACAAAAGGTTTTATCTGGGAAAGGGAAGACAGTCACCTATTTGGATACAGAGCAAGAGCTGATCCTGCCAGAGATTGGCTATCAGTTACTGCACAATTATTCTGAACAAATAAGAAATTGGGGTTGGATCTGTGACATTCACACCCAAGATTCAAAGCCCTTCAAGAAGAATTTGAACCTTATTCGTCGTAAGCCGGCTGTTGTCAAACTTCTTGCAGTACCTTGCATTTTAGGTGTCAATTTATCTCATGTTGATCTCCTGGAATTTCTACAACAGCTTGCTGATACAGATGGATCATCAACAATGCCTCCATCAATTATTCGAATTCTTAATTCTAAAGCATGCAGAGGTGCAATTATGTTTGGAGACTCCTTGCTACCTTCAGAATGTTCCTTAATTGTTGAAGAGCTGAAGCAGACGTCCCTGTGCTTCCAATGTGCTCATGGGCGACCAACCACTGTCCCGGTTGTGAAGTTGGAGGCATTGCATAGGCAGATAGCTAAAATGCAAATGAAGGATGGTGGTCCAAGGGAATTGTGGCACGGGCTATGTCGACACAGAGTCAGCCTTGAACGAGCCAGCTTGCGCTTAAGTGCAGCTGGAGGTTAG |
Protein: MGSIKPLPEAVRSSVRSAIILFDLTRVVEELIFNSLDASASKVSVFVSVGSSYVKVVDDGSGISRDGLVSLGERYVTSKLYHLGDLDAASRSFGFRGEALASISDVALVEIITKAYGKPNGYRKVIKGSKCLYLGIDDDRKDAGTTVVVRDLFYNQPVRKKHMQSCPKKVLHSVKKCVFRMALVHPMVYFNVIDIESEDELLSTHPSSSPLSLLMSGFGIEDCTSLQKLNADDGSLKLSGYITGSWDNFAVKAFQFVYINSRFVCKGPIHKLLNNLATSFESLDSKKANNWTKKGKRSRPQVFPSYILNISCPPSFYDLTLEPSKTYVEFKDWASILTLIEKTIQHLWRKNICRANGLGQAETLKEDDNILHVEEDFFDEGPSVDSEFATRKRWTQKYRPSSSLEKLTTDHLFLTDHEDIPFEECHVNNAQFRDQQNNMKFVHWTDYSFQSWDDSLVKGTSSVFERSDCCLLTTNNNSLVEDYFLENRFTASGRSNCHVNNNGICSKLGNASDVVESDVTNGTDRNIFPFDYHEHYNDSQFRKNISKPFLQSCSSQRTLPLDRELVESEKGIEPPMDSFKTKAKQVCSNERFNMLKTDSSDQTMWQDGGPCGQIYPKLVSKGGIARDLDVLTRASAKSFLSCGDVSIEENGLPSDSVTPIEKTGSGHQSLSSEWCSGTSNPFEQFSYKNPIEGCFRSEERTNFGHFSAGEDEDYQFSFDLISRSSSQEKCIYDCPNTGLEIDYAKSSRDFHGFLQQYNLNHTFSPEDSNVAIEERDWLCTDSSINEYKRQIDWFQYQDVEQNPIPKERARRSQSAPPFCSYKRRFISLHHCLASGEPTFSEVRGPFTSPEIGEKKPPQQSSGVDNLHFEPSFGKNRSNMNNKPNMVFSTVVRKCEDIEQPHCLEGPESAPVQVFISKGNQDPANSGTKWRSGFAQNTSNSKLCDIDYEYNVLDIASGLPFVATKSLVPESINKNCLRDAKVLQQVDKKFIPIVAGGTLAIIDQHAADERIQLEELRQKVLSGKGKTVTYLDTEQELILPEIGYQLLHNYSEQIRNWGWICDIHTQDSKPFKKNLNLIRRKPAVVKLLAVPCILGVNLSHVDLLEFLQQLADTDGSSTMPPSIIRILNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGRPTTVPVVKLEALHRQIAKMQMKDGGPRELWHGLCRHRVSLERASLRLSAAGG |